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A.  Original  German  passage  25 

Wenn  ein  modernes  Elektronenubermikroskop  hinsichtlich  seiner 
optischen  Leistungsfahigkeit  beurteilt  wird,  so  steht  fast  immer 
die  Frage  im  Vordergrund,  bis  zu  welcher  Grosse  herab  kleine  Ein- 
zelheiten  in  getreuer  Form  dargestellt  werden. 


B.  German  Passage  Rearranged  for  English  Translation 

Wenn  ein  modernes  Elektronenubermikroskop  x^ird  beurteilt  hinsichtlich 
seiner  optischen  Leistungsfahigkeit,  so  die  Frage  steht  fast  immer 
im  Vordergrund,  bis  zu  welcher  Grosse  herab  kleine  Einzelheiten 
V/erden  dargestellt  in  getreuer  Form. 
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MASSACHOSfflTS  IHSTI2U2E  OF  ESCH5QLQGY 
Conference  on  Mechanical  Translation 


Yeu  are  cordially  invited,  to  attend  the  public  opening  session 
of  ths  Conference  on  Hocnanical  Translation  onJaae  17,  1252  at  8?00  F.r. 
la  Huntington  Hall,  10-250. 

This  3assion  will  have  the  fcrE  of  a  symposium.  Five  speakers 
Mill  talk  on  various  non=  technical  aspects  of  mechanical  translation  followed 
&y  a  tr^e  sticn.=an&~ansv'er  period  for  which  all  active  participants  of  the 
conference  Hill  serve  as  a  panel. 

Only  slight  changes  are  expected  in  the  following  program. 

Professor  J„  B„  Wiesner,  Director  of  the  Research  Laboratory  of  Electronics,  M.  X.  T«.  — 

Moderator 

hp.  Y.  Bar-Hillel,  Research  Associate,  Research  Laboratory  of  Electronics,  K.  I.  S.  —  — 

Heeds  and  possibilities  for  meci  animal  translation. 


Professor  Leon  Bostert,  Georgetota  University,  WashiEgton,  D.  C. 

Human  translation  versus  machine  translat: 

Dr.  Olaf  Eelmer,  Rand  Corporation,   Santa  Monica,   3allforida 

The  structure  of  the  problem  of  mechanical  transl  tion. 

Dr.  A.  D„  Booth,  Electronic  Computer  Section,  BirSfbsok  College,  Londo  .,  EngL 

How  intelligent  can  a  machine  trrjislator  be? 

Dr.  Jamas. tf.  Perry,  Center  of  International  Studies,  M.I.T. 

Machine  techniques  for  Index  searching  and  trai>.sl  .tion. 


Y.  Ear~Hill€  .,  Chairmen 
Organising  C  >mmitteo 


y-^n-3.^2, 


Reprinted  from  Philosophy  of  Science 

Vol.  20,  No.  3,  July,  1953, 

Printed  in  U.S.A. 


2 


c 


SOME  LINGUISTIC  PROBLEMS  CONNECTED  WITH 
MACHINE  TRANSLATION1 

YEHOSHUA  BAR-HILLEL 

During  my  recent  work  on  machine  translation  (3),  I  have  come  across  many 
problems  of  a  linguistic  nature  that  should  be  of  general  methodological  interest. 
Some  of  these  problems  have  never  been  treated  extensively  before.  Others  that 
have  been  discussed  previously  appear  now  in  a  different  and  rather  interesting 

light. 

The  task  of  instructing  a  machine  how  to  translate  from  one  language  it  does 
not  and  will  not  understand  into  another  language  it  does  not  and  will  not  under- 
stand presents  a  real  challenge  for  structural  linguists,  in  that  their  thesis  that 
language  can  be  exhaustively  described  in  non-referential  terms  undergoes  here 
an  experimcntum  cracis.  If,  in  a  translation  program,  some  step  has  to  be  taken 
which  directly  or  indirectly  depends  upon  the  machine's  ability  to  understand 
the  text  on  which  it  operates,  then  the  machine  will  simply  be  unable  to  make 
this  step,  and  the  whole  operation  will  come  to  a  full  stop.  (I  have  in  mind  present 
day  machines  that  do  not  possess  a  semantic  organ.  The  situation  will  change 
in  the  not  too  distant  future.) 

I  intend  to  deal  with  four  specific  problems,  of  which  the  only  obvious  common 
feature  is  the  decisive  role  which  they  play  in  machine  translation.  The  problems 
are,  in  the  order  in  which  they  will  be  treated: 

1.  Operational  Syntax 

2.  Intertranslatability  of  natural  languages 

3.  Idioms 

4.  Universal  syntactic  categories 

1.  Operational  Syntax.  One  of  the  decisive  steps  in  certain  methods  of  machine 
translation  is  the  determination  of  the  syntactic  structure  of  any  given  sentence 
in  the  source-language  (i.e.,  the  language  from  which  we  translate)  to  a  required 
degree  of  explicitness.  Since  thinking  in  terms  of  machines  might  perhaps  be 
difficult  for  the  reader,  let  him  imagine  an  utterly  moronic  student  without  the 
slightest  knowledge  of  either  the  source-language  or  the  target-language,  i.e., 
the  language  into  which  the  given  text  is  to  be  translated,  and  with  an  extremely 
restricted  understanding  of  his  own  native  language,  but  with  the  following 
abilities  which  are  rather  remarkable  for  a  human  being  with  such  a  constitution: 
he  is  able  to  identify  the  letter  shapes  of  the  source-language,  he  has  an  unfailing 
and  unlimited  memory,  and  he  is  extremely  fast  in  carrying  out  those  instructions 
which  are  formulated  in  that  small  language  fragment  he  understands.  I  shall 
not  go  into  the  detailed  specification  of  these  instructions.  Let  me  mention  only 
the  two  most  important  operations  he  is  able  to  carry  out:  matching  the  given 
text  or  any  part  of  it  with  any  of  a  number  of  fists  presented  to  him,  and  counting. 
Though  it  might  appear  as  if  these  operations  were  quite  restricted  in  their 

1  This  work  has  been  supported  in  part  by  the  Signal  Corps,  The  Air  Materiel  Command, 
and  the  Office  of  Naval  Research,  and  in  part  by  the  Rockefeller  Foundation. 
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(Victor  A*  Oswald,  Jr0) 

Tihea  I  learssd  teat  I  had  been  suESiOKQd  to  addrore  myself  te  the  topi*  " 
of  word-by-word  translation  I  felt  lib*  a  geographer  *OTitecl  to  r:  teuss  the 
utility  of  tho  sorption  that  the  wor?,d  is  flat*     In  short,  I  oar  only  «ey 
ttat  word-byword  tra&sZatioe  is  sot  possible*  if  w«  are  to  ozserrtand  by 
the  term  a  wordwiso  transverbalisstxoa  frm  one  language  into  another, 
partieularly  from  German  icfco  English. 

lb  was,  indeedc  the  diecorory  of  th*  imposaiblli^r  of  wordalee  ti 
lation  that  prosptod  the  syntactical  lcroetigation  outlined  la  Proposals  for 
the  Mqchaaioal  Resolution  ^  Seaman  Syrtaa  Brtfterns.     The  research  of       t 
summer  of  29  50  was  begun  by  translating  word  by  word  into  English  various 
German  texts  in  ths  field  of  ffiathematics.     Our  efforts?  rapidly  ease  to  griof, 
chiefly  beeause  of  tho  lamentable  faot  that  -fee  Seaaaa  "articles*  *re  also 
'Vords",  but  words  of  Rroteaa  traasfbnsations  *fcen  carried  over  into  I&iglifjii. 
Talcs  the  harmless-looking  little  form  der .    A  word-by-word  transwcrballsatioa 
into  English  waild  require-  to  be  eoaplete,  a  listing  of  tho  following 
possibilities,     "the"  (dar  Maoa)}  "of  the"  (der  Frmi  or  dsr  Franen)i  "to  the, 
.fbr  ttie"  (also  der  j*tm)?  "he"  (d^;  kofflmt  nieht)t  "hcr„  to  her"  (car  gob^ 
ich*8.  nie)*  Vho"  ( der  Kaan,  der  kommfe  ...)|  "sfcom"  ( die  u*m ,  der  ieh  es  gab) 
—  and  diverse  other  more  subtle  variants.     2hs»  oSner  forcis  of  tir    "ar^iolc:" 
require  almost  ail  orally  eeaples  tranimjrbalisaticns  n    73i*n  to  <  ireum- 
staaees  such  as  thase  there  is  added  ths  distressing  oddity  of  Seiman  word 
order,  word-by-word  translation  from  German  into  English  beenaas  cither  a 
jest  or  a  horror.     To  be  qaite  sure  of  the  impossibility  of  wordw-se  trans- 
lation, we  concocted  diverse  multiple- choice  traaslatioaa prim^tiiie 

ancestors  of  Mr.  Bar-Hillel»s  appandix  to  "The  Present  State  of  R<  search  on 
Iff"  —  and  -these  translations  we  submitted  to  our  naihematieal  colleagues. 


y_^Hi42^ 


IVicto:/  ,\.  C :.■•- ■■:•...-._.   ;"">0; 

Ao  I  have   7.i.u  '..-..:>:■        "     i?i  persuaded  that  \te  sust  ieuise  "' 
pr^dti   ::-  oi  KU  in  soa>  thV:.  w:  .' 

■  p:  less  iassdiotol  .   LI  .gii  Is 

:    '■  List  ir?   kbe  field  the  tranelat  d    ;ext   y  ertalr  3„  .       .  -  .; 

?r*3-«cd:  -       '/  kh  L     -  tal  3     tl      /      can  pe:  i  11       .  -  -.-\  '.  •.  •:- 

-•3  ;'"r"    ™i   1  see,    ;.  ."  101O  ^;e:i;::  :-..V;>  among  :-  duties  thai 

-    i  ■•'  fins  E.cture  surfaces  -  nan  oojs>- 

/   /:■■"    "  -      ■■-'.-■-•1-v- .  aal  in  the  '  '.'  :   b-3  re 

tc    sha;  '        I'oizgbrhefrT.,  in  par--:  out, 

'••        "  '"  "'-■  Lr<     '   ■"  hsve  .•-.-      Aba       all,  X- she 

'■  ■        ■■-  ■'  to  =    .      ■■  .."..'■  ,  satisfactory 

-    from  the  bs  -.v.  U  S:  '  :1  ;ha  "Li        sac! 

prepo  Ln   fch      -:-.    ;         \::i  ::/;;:;;•.?  to     "  ,  •.  -     .      late  equiv  lent 

asro:       the  less    '.'  ■  - \ . :.. ;  of  m?iLtiple  chc  Lc   -     .:.:     soaj 

ol       ■  ';.  gsr      ■'  prr-editox-.   however,  cennot  effici  In  the 

::■:;•-"••■  that  have  dives  mee  in 

diverse         bests      fc  monolii  71a]    sost-editor  ••rill  in 

'•riLth  all  sorts  of  conte^  '   h  he  is  net  familiar,  l?.s,  as  1 

have  pointed  out,  will  ;  cr  a  littxe  better  equip   3d  to  eorm  with  the 

problejSo     ?or,  if  we  assume        osj   resolve  patterns  of  syntactic  connection 

TL  —  as  I  aw  3ui*e  we  can  — -  and  if    re  a  ~:~v  ;s 
sen  supply  at  a  reasonable  speed  f!1  the  possible  significance     in  the  TL  for 
each  mean3 n^-bearins  word  in  the  "'L,  than  the  correct sn ,c e  recjuirrod  for 
ultimate  interpretation  c  '  the  terv.  is  not  linguistic  insight  ur-  skill,  but: 


■ 


MICRQSEIIAKTICS 
(Victor  Aa  OswaM*  Jre) 

As  I  have  said  before,  I  am  persuaded  that  we  must  devise  the 
product  of  MT  in  some  fashion  that  will  males  j.t3  at  the  conclusion  of 
our  process,  snore  or  less  immediately  intelligible  to  a  monolingual 
specialist  in  the  field  to  which  the  translated  text  pertains  <>  A  general 
pre~editor  (monolingual  in  the  FL)  can  perform  all  sorts  of  useful  tasks? 
as  you  will  see,  I  should  like  to  assign  him  among  other  duties  that  of  in- 
structing the  machine  where  to  find  the  fracture  surfaces  of  German  com- 
poundSo  A  general  post-editor  (monolingual  in  the  TL)  should  be  required 
to  shape  the  ends  the  machine  has  rough-hewn,  in  particular  to  smooth  out 
the  word  order  where  the  machine  may  have  jumbled  ito  Above  all,  I  should 
want  to  assign,  to  a  post-editor  the  task  of  choosing  the  most  satisfactory 
preposition  from  the  battery  we  shall  have  to  provide  in  the  TL  for  each 
preposition  in  the  FL,  and  likewise  to  choose  the  appropriate  equivalent 
from  among  the  less  formidable  array  of  multiple  choices  for  conjunct. ions,, 
A  monolingual  general  pre-editors  however,  cannot  efficiently  aid  in  the 
interpretation  of  meaning-bearing  words  that  have  diverse  significance  in 
diverse  context  s<>  A  monolingual  post-editor  will  inevitably  be  confronted 
with  all  sorts  of  contexts  with  which  he  is  not  familiar,  Bilinguals,  as  I 
have  pointed  out,  will  be  only  a  little  better  equipped  to  cope  with  the 
problem,.  For,  if  we  assume  we  can  resolve  patterns  of  syntactic  connection 
from  the  FL  into  the  TL  —  as  I  am  sure  we  can  ~  and  if  we  assume  that  vie 
can  supply  at  a  reasonable  speed  all  the  possible  significances  in  the  TL  for 
each  meaning-bearing,  word  in  the  FL,  then  the  competence  required  for  the 
ultimate  interpretation  of  the  text  is  not  linguistic  insight  or  skill,  but 
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S..H.  Rio!:-"-  and  ft..B.  Booth 

.-„  lisez,  llsez;  jetes  vos  granmaires 
au  feu. 

Schlegel , 

The  following  outline  summarizes  some  suggestions  or.  ways  in  vhich 
translation  nay  be  :  -   i  id,  worked  out  by  the  authors  in  1948-52.  The 
treatment  is  not  axhaustive  and  nary  modifications  in  detail  would  certainly 
be  necassary  should  the  ideas  here  mooted  be  put  into  active  operation.  It 
is  considered  however  that  it  would  be  serviceable  to  describe  the  techniques 
here  envisaged  in  their  present  rudimentary  form,  rather  bhan  -.-..ait  their 
further  refining,  zLcv,a   they  any  prove  of  interest  to  others  working  along 
the  same  linos „ 

.  paper  is  cirid;;cl  into  three  parts.  The  first  deals  with  general 
principles,  the  c  -c:;'  with  spscimcn  translations,  while  the  third  considers 
in  detail  the  working  of  so:ao  specific  mechanical  translation  schedules. 

A  language  is  a  series  of  symbols  representing  ideas.  The  simplest 
conceivable  -written  language  would  have  one  symbol  per  idea,  together  with 
appropriate  rales,  possibly  involving  eztra  symbols  for  syntactical  relations, 
Such  a  language  does  not  exist.  The  nearest  approach  is  probably  Chinese 
or  the  nucerical  notation  in  general  use-  In  both  these  cases  the  symbols 
represent  ideas  directly,  Usually,  written  languages  symbolise.,  not  ideas, 
but  other  symbols,  namely  sounds,  spoken  words,  that  represent  ideas.   Since, 
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Samples  arc  given  in  this  section  of  the  sorts  of  translation  to  be  ejected  . 
from  the  application  of  the  above-mentioned  procedures  to  specific  languages,,  The 
sentences  chosen  have  been  selected  at  rac-Icm  from  the  biological  literature  in 
these  languages,  merely  avoiding  sentences  with  proper  names  or  numerical  data0 
For  convenience  of  type-setting,  the  samples  are  taken  only  from  lani-fuages  normally 
written  in  Reman  script,  but  sentences  from  too  oriental  languages,  Arabic  and  Japanese, 
have  been  transliterated  to  illustrate  further  point rs^ 

To  illustrate  the  relative  unimportance  of  syntax,  many  near-vacuous  mords,  in 
particular  the  articles,  have  been  treated  a3  vacuous,  and  no  indication  is  given  of 
grammatical  forms  indicating  the  singular  number,  third  person,  or  present  tense 0 
The  resultant  translation  thus  combines  an  Snglish  vocabulary  nith  a  granHsar  or  lack 
of  grammar  more  characteristic  of  C:       It  is  probable  that  a  sumowhat  less  ea=> 
treme  treatment  of  near-vacuous        ...  be  more  useful  :li     '.ce0 
The  grammati*     >reviat:         re  as  fa 
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